An Integral Projection-based Semantic Autoencoder for Zero-Shot Learning

نویسندگان

چکیده

Zero-shot Learning (ZSL) classification categorizes or predicts classes (labels) that are not included in the training set (unseen classes). Recent works proposed different semantic autoencoder (SAE) models where encoder embeds a visual feature vector space into and decoder reconstructs original space. The objective is to learn embedding by leveraging source data distribution, which can be applied effectively but related target distribution. Such embedding-based methods prone domain shift problems vulnerable biases. We propose an integral projection-based (IP-SAE) projects concatenated with latent representation force reconstruct visual-semantic Due this constraint, projection function preserves discriminatory inside enriched forces more precise reconstitution of invariant manifold. Consequently, learned less domain-specific alleviates problem. Our IP-SAE model consolidates symmetric transformation for projection, thus, it provides transparency interpreting generative applications ZSL. Therefore, addition outperforming state-of-the-art considering four benchmark datasets, our analytical approach allows us investigate distinct characteristics generative-based unique context zero-shot inference.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Class label autoencoder for zero-shot learning

Existing zero-shot learning (ZSL) methods usually learn a projection function between a feature space and a semantic embedding space(text or attribute space) in the training seen classes or testing unseen classes. However, the projection function cannot be used between the feature space and multi-semantic embedding spaces, which have the diversity characteristic for describing the different sem...

متن کامل

Zero-Shot Learning for Semantic Utterance Classification

We propose a novel zero-shot learning method for semantic utterance classification (SUC). It learns a classifier f : X → Y for problems where none of the semantic categories Y are present in the training set. The framework uncovers the link between categories and utterances through a semantic space. We show that this semantic space can be learned by deep neural networks trained on large amounts...

متن کامل

Preserving Semantic Relations for Zero-Shot Learning

Zero-shot learning has gained popularity due to its potential to scale recognition models without requiring additional training data. This is usually achieved by associating categories with their semantic information like attributes. However, we believe that the potential offered by this paradigm is not yet fully exploited. In this work, we propose to utilize the structure of the space spanned ...

متن کامل

Semantic Graph for Zero-Shot Learning

Zero-shot learning aims to classify visual objects without any training data via knowledge transfer between seen and unseen classes. This is typically achieved by exploring a semantic embedding space where the seen and unseen classes can be related. Previous works differ in what embedding space is used and how different classes and a test image can be related. In this paper, we utilize the anno...

متن کامل

Semantic Softmax Loss for Zero-Shot Learning

A typical pipeline for Zero-Shot Learning (ZSL) is to integrate the visual features and the class semantic descriptors into a multimodal framework with a linear or bilinear model. However, the visual features and the class semantic descriptors locate in different structural spaces, a linear or bilinear model can not capture the semantic interactions between different modalities well. In this le...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3303640